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Abstract 



One of the strongest techniques available for showing lower bounds on quantum communi- 
cation complexity is the logarithm of the approximation rank of the communication matrix — 
the minimum rank of a matrix which is close to the communication matrix in l^o norm. This 
technique has two main drawbacks: it is difficult to compute, and it is not known to lower 
bound quantum communication complexity with entanglement. 

Linial and Shraibman recently introduced a norm, called 72 , to quantum communication 
complexity, showing that it can be used to lower bound communication with entanglement. 
Here a is a measure of approximation which is related to the allowable error probability of 
the protocol. This bound can be written as a semidefinite program and gives bounds at least 
as large as many techniques in the literature, although it is smaller than the corresponding a- 
approximation rank, rk^. We show that in fact log 72 (A) and logrkQ,(A) agree up to small 
factors. As corollaries we obtain a constant factor polynomial time approximation algorithm 
to the logarithm of approximate rank, and that the logarithm of approximation rank is a lower 
bound for quantum communication complexity with entanglement. 

1 Introduction 

Often when trying to show that a problem is computationally hard we ourselves face a compu- 
tationally hard problem. The minimum cost algorithm for a problem is naturally phrased as an 
optimization problem, and frequently techniques to lower bound this cost are also hard combina- 
torial optimization problems. 

When taking such an computational view of lower bounds, a natural idea is to borrow ideas 
from approximation algorithms which have had a good deal of success in dealing with NP-hardness. 
Beginning with the seminal approximation algorithm for MAX CUT of Goemans and Williamson 
IIGW95L a now common approach to hard combinatorial optimization problems is to look at a 
semidefinite relaxation of the problem with the hope of showing that such a relaxation provides a 
good approximation to the original problem. 

*Supported by a NSF Mathematical Sciences Postdoctoral Fellowship. Email: troyjlee@gmail.com 
^ Email: adi . shraibman @weizmann.ac.il 



1 



We take this approach in dealing with approximation rank, an optimization problem that arises 
in communication complexity. In communication complexity, two parties Alice and Bob wish to 
compute a function f : X xY ^ {"1; +1}' where Alice receives x G X and Bob receives y eY. 
The question is how much they have to communicate to evaluate f{x, y). We can associate to / 
a |X|-by-|y I communication matrix Mf where Mf[x, y] = f{x,y). A well-known lower bound 
on the deterministic communication complexity of / is logrk(Mj); the long standing log rank 
conjecture asserts that this bound is nearly tight in the sense that there is a universal constant c 
such that (log Tk(Mf)Y is an upper bound on the communication complexity of /. 

When we turn our attention to bounded-error communication complexity, where Alice and Bob 
are allowed to get the wrong answer with some small probability, the relevant quantity is no longer 
rank but approximation rank. For a sign matrix A, the approximation rank, denoted rka(A), is 
the minimum rank of a matrix A' such that J < A o A' < aJ, where J denotes the all ones 
matrix and o is entrywise product. Here a is a measure of approximation and is related to the 
allowable error probability of the protocol. It is known that i7(logrk2(M/)) is a lower bound on 
the bounded-error randomized communication complexity of / and the quantum communication 
complexity of / when the players do not share entanglement. In addition, approximation rank is 
one of the strongest lower bound techniques available for these models. In view of the log rank 
conjecture it is also natural to conjecture that a polynomial in the logarithm of approximation rank 
is an upper bound on the communication complexity of these models. 

As a lower bound technique, however, approximation rank suffers from two drawbacks. The 
first is that it is quite difficult to compute in practice. Although we do not know if it is NP-hard, 
many similar rank minimization problems are in fact NP-hard I1VB96II . The second drawback is 
that it is not known to lower bound quantum communication complexity with entanglement. 

In this note, we address both of these problems. We make use of a semidefinite programming 
quantity 72 which was introduced in the context of communication complexity by [LMSS07]. This 
quantity can naturally be viewed as a semidefinite relaxation of rank, and it is not hard to show that 
(^))^ ^ i'kQ,(yl). We show that this lower bound is in fact reasonably tight. 

Theorem 1 Let 1 < a < 00. Then for any m-by-n sign matrix A 

[a — 1)° 

72 (v4) can be written as a semidefinite program and so can be computed up to additive error e in 



time polynomial in the size of A and log(l/e) by the ellipsoid method (see [IGLS88II ). Thus The 



orem 1 gives a constant factor polynomial time approximation algorithm to compute logrkQ,(y4). 
Moreover, the proof of this theorem gives a method to find a near optimal low rank approximation 
in randomized polynomial time. 

Linial and Shraibman IILS07II have shown that log72 (Mj) ^ lower bound on quantum com- 
munication complexity with entanglement, thus we also obtain the following corollary. 

Corollary 2 Let < e < 1/2. Let Q*{A) be the quantum communication complexity of a m-by-n 
sign matrix A with entanglement. Then 

Q*e{A) > Trlogrkc,^ - ^loglog(mra) - 21ogae + log(ae - 1) - 0(1), 
o 2 
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where = j^^. 

The log log factor is necessary as the communication matrix of the n-bit equality function with 
communication matrix of size 2"-by-2" has approximation rank VL{\ogn) flAloOSII . but can be 
solved by a bounded-error quantum protocol with entanglement — or randomized protocol with 
public coins — with 0(1) bits of communication. 

Our proof works roughly as follows. Note that the rank of a m-by-n matrix A is the smallest 
k such that A can be factored as A = X'^Y where X is a k-hy-m matrix and F is a A;-by-n 
matrix. The factorization norm 72(A) can be defined as minxy ■.xty=ac{X)c(Y) where c(X) is 
the smallest £2 norm of a column of X. Let Xq, Yq be an optimal solution to this program so that 
all columns of Xo,Fo have squared £2 norm at most 72(A). The problem is that, although the 
columns of Xq, Yq have small £2 norm, they might still have large dimension. Intuitively, however, 
if the columns of Xq have small £2 norm but Xq has many rows, then one would think that many of 
the rows are rather sparse and one could somehow compress the matrix without causing too much 
damage. The Johnson-Lindenstrauss dimension reduction lemma IIJL84II can be used to make this 
intuition precise. We randomly project Xq and Yq to matrices Xi, Yi with row space of dimension 
roughly ln(mn)72 (A)^. One can argue that with high probability after such a projection XjYi still 
provides a decent approximation to A. In the second step of the proof, we do an error reduction 
step to show that one can then improve this approximation without increasing the rank of XfYi by 
too much. 



2 Preliminaries 

We will need some basic notation. For a vector u, we use for the £2 norm of u. For a matrix 
A, we use for the spectral or operator norm and \\A\\tr for the trace norm. The notation A0 B 
denotes tensor product, and Ao B denotes the entry wise product of A and B. 
We will use the following form of Hoeffding's inequality llHoe63ll : 

Lemma 3 ( IIHoe63ll ) Let G M, and 61, . . . ,6n be random variables with Pr[(5j = 1] = 

Fi[5i = -1] = 1/2. Then for any t > 

Pr 

Our main tool will be the factorization norm 72 [ITJ89II , introduced in the context of complexity 
measures of matrices by ILMSSOVII . This norm can naturally be viewed as a semidefinite program- 
ming relaxation of rank as we now explain. 

We take the following as our primary definition of 72: 

Definition 4 (l|Tj89|. IILMSS07ll ) Let A be a matrix. Then 

72(A) = min c(X)c(r), 

X,Y:XTY=A 

where c(X) is the largest I2 norm of a column of X. 



> t 



< 2 exp 



i=l 
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The quantity 72 can equivalently be written as the optimum of a maximization problem known 
as the Schur product operator norm: 72(^4) = maxx:j|x||=i ||^ ° The book of Bhatia (Thm. 
3.4.3 HBhaOV n contains a nice discussion of this equivalence and attributes it to an unpublished 
manuscript of Haagerup. An alternative proof can be obtained by writin g the opt imization problem 



defining 72 as a semidefinite programming problem and taking its dual flLSSOSJ. Using duality of 
the operator norm and trace norm it is then straightforward to see that the Schur product operator 
norm is equivalent to the expression given in the next proposition, which is most convenient for 
our purposes. A proof of this last equivalence can be found in [|Mat93ll . 

Proposition 5 (cf. [.LSSOS D Let A be matrix. Then 

72(^4)= max ||y4 o t>M-^||jj, 

||u||=|'|i,||=l 

From this formulation we can easily see the connection of 72 to matrix rank. This connection is 
well known in Banach spaces theory, where it is proved in a more general setting, but the following 
proof is more elementary. 



Proposition 6 (fTW^, IILSSOSl ) Let A be a matrix. Then 

12{AY 



rk(v4) > 



Proof: Let u, v be unit vectors such that 72(A) = || A o vu^Wtr- As the rank of A is equal to the 
number of nonzero singular values of A, we see by the Cauchy-Schwarz inequality that 



Tk{A) > 

As rk(y4 o vu^) < ik{A) we obtain 



I 4I|2 



I A l|2 



□ 



Finally, we define the approximate version of the 72 norm. 
Definition 7 ( I1LSO70 ) Let Abe a sign matrix, and let a>l. 



72"(A) = min 72(A') 

A':J<AoA'<aJ 



where J is the all ones matrix. 



4 



As corollary of Proposition [6] we get 
Corollary 8 Let Abe a sign matrix and a > 1. 

We will also use a related norm u known as the nuclear norm. 
Definition 9 ( HJam871| ) Let Abe a matrix. 

ij{A) = min I \ai\ : A = ^aixjy^j 

where Xi,yi e {-1,+1}". 

It follows from Grothendieck's inequality that i'{A) and 72(A) agree up to a constant multi- 
plicative factor. For details see nLS07L 

Proposition 10 (Grothendieck's inequality) Let Abe a matrix. 

l2{A)<y{A)<KGi2{A), 
where 1.67 < Kg < 1.78 ... z^' Grothendieck's constant. 

3 Main Result 

In this section we present our main result relating 72 (A) and rkQ,(y4). We show this in two steps: 
first we upper bound ikoL'{A) in terms of 72 (A) where a' is slightly larger than a. In the second 
step, we show that this approximation can be tightened to a without increasing the rank too much. 

3.1 Dimension reduction 

Theorem 11 Let Abe a m-by-n sign matrix and a > 1. Then for any < t < 1 

< 47?(^)Mn(4^n) 

1-t 

Proof: Instead of 72 we will work with the z/" norm which makes the analysis easier and is only 
larger by at most a multiplicative factor of 2. 

Thus let X^Y be a factorization realizing z/"(A) — in other words, J < X^Y o A < aJ, and 
all the columns of X, Y have squared £2 norm at most i'°'{A). Moreover, X, Y are layered in the 
sense that for every row, all elements in that row have the same magnitude. Indeed, balancing as 
necessary, we may assume that \X[i, j] \ = \Y[i, j] \ = y/j3i for all j, and that z/"(A) = ^ . 
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Say that X is a k-hy-m matrix and F is a k-hy-n matrix. Let Sij be independent identically 
distributed variables chosen to be +1 with probability 1/2 and —1 with probability 1/2. For an 
integer k' to be fixed later, we let i? be a A;-by-A;' random matrix where R[i, j] = 

LetXi = i?^XandYi = R^Y . Then clearly rk(XfFi) < k' . We now estimate the probability 
over choice of R that Xf Yi gives a good approximation to A. 

Consider 

k' 

k' / k \ / 



=1 \r=l 



,r=l 



fc' 

5^ X[r, z]Y[r, j] + Y,Yl ^[^' ^]^[^' ']^[^'' ^]^[^'' -^l 

£=1 r^r' 



Thus to bound the difference between X'^Y[i,j] and X'[Yi[i,j], it suffices to bound the magnitude 
of the second term. 



Pr 

R 



> t 



Pr 



< 2exp 



EE 

£=1 ri^r' 
-t^k' 



k' 



> t 



by Hoeffding's inequality ( [Lemma 3 ). 

By taking k' = 2i^"(y4)^ ln(4mr?,)/t^ we can make this probability less than l/2mn. Then by a 
union bound there exists an R such that 

\x''Y[^,J]-XlY^[^,J]\<t 

for all Rescaling the matrix XfYi gives the theorem. □ 



3.2 Error-reduction 

In this section, we will see that given a matrix A' which is an «' > 1 approximation to A, we can 
obtain a matrix which is a better approximation to A and whose rank is not too much larger than 
that of A' by applying a low-degree polynomial approximation of the sign function to the entries 
of A'. This technique has been used several times before, for example nAlo08[|KS07L 

Let p(x) = aQ + aix + . . . + UdX^ be a degree d polynomial. For a matrix A, we define p{A) to 
be the matrix aoJ + aiA + . . . + OrfA"'^ where y4°* is the matrix whose entry is j]'', and 
J is the all ones matrix. 
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Lemma 12 Let Abe a matrix and p be a degree d polynomial. Then rk(p(A)) < {d+ l)ik{AY 
Proof: The result follows using subadditivity of rank and that rk(A°*) < ik{A'^^) = Tk(Ay since 



In general for any constants 1 < (3 < a < oo one can show that there is a constant c such that 
rkj3{A) < rka(Ay by looking at low degree approximations of the sign function (see Corollary 1 
of [IKS07 1 for such a statement). As we are interested in the special case where a, P are quite close, 
we give an explicit construction in an attempt to keep the exponent as small as possible. 

Proposition 13 Fix e > 0. Let aa = 1/(2 + 6e + 4e^), and ai = 1 + 03. Then the polynomial 



maps [1, 1 + 2e] into [1, 1 + e] and [—1 — 2e, —1] into [—1 — e, —1]. 

Proof: As p is an odd polynomial, we only need to check that it maps [1,1 + 2e] into [1, 1 + e]. 
With our choice of 01,03, we see that p(l) = p(l + 2e) = 1. Furthermore, p{x) > 1 for all 
X G [1,1 + 2e], thus we just need to check that the maximum value of p{x) in this interval does not 
exceed 1 + e. 

Calculus shows that the maximum value of p{x) is attained at a; = i^^^Y^"^- Plugging this 
into the expression for p{x), we see that the maximum value is 



OS 



is a submatrix of A' 



□ 



p{x) = aix — a^x^ 



max p(x) 

a;e[l,l+2e] 




We want to show that this is at most 1 + e, or equivalently that 



2 V2 + 6e + 4e2 /3 + 6e + Ae^ 



.2\ 3/2 



< 1. 



One can verify that this inequahty is true for all e > 0. 



□ 



3.3 Putting everything together 



Now we are ready to put everything together. 



Theorem 14 Fix a > 1 and let Abe a m-by-n sign matrix. Then 



{a -I) 



8192a' 



ln^(4mn)7^(A)^ 
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Proof: We first apply 
obtain 



Theorem 11 



with t 



a-l 
2a 



. In this way, we have ^zf < 2« — 1. Thus we 



rk 



2a- 1 



{A)<-^^^HAmnh-{Ay. 
[a - IV 



Now we can use the polynomial constructed in [Proposition 13| and |Lemma 12| to obtain 

tK{A) < 2rk2„_i(A)3 < ^l^ln3(4mn)72"(A)^ 

[a — 1)° 



□ 



4 Discussion and open problems 

One of the fundamental questions of quantum information is the power of entanglement. If we 
believe that there can be a large gap between the communication complexity of a function with 
and without entanglement then we must develop techniques to lower bound quantum communica- 
tion complexity without entanglement that do not also work for communication complexity with 
entanglement. We have eliminated one of these possibilities in approximation rank. 

The strong relation between 72 (^) and rka{A) for constant a might be surprising in light of 
the behavior we encounter when a = 1 or when a goes to infinity. When a = 1 the n dimensional 
identity matrix provides an example where 72 is constant while the rank is equal to n. When 
a 00 the quantity ikodA) is the well-studied sign rank. Using a similar random projection 
technique, Ben-David et al. [IBES02II show that rkoo(A) < 0(\og{mn)'y^{AY) for a m-by-n 
matrix A. In this case, however, the lower bound fails as there are examples of an exponential gap 
between sign rank and 7^^ IIBVW07[IShe08ll . 

Back to the computational point of view, it remains an interesting open problem to find an 
efficient approximation algorithm for sign rank. More generally, one can ask about other kinds 
of rank minimization problems. We have just dealt with a very specific case where one has a 
constraint of the form Ao X > 0. One could look more generally at linear equality constraints of 
the form Ai • X = bi, . . . , Ak • X = bk- Such problems arise in many areas and are known to be 
NP-hard. 
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